Genome-wide comparison of the protein-coding repertoire reveals fast evolution of immune-related genes in cephalochordates and Osteichthyes superclass
نویسندگان
چکیده
Amphioxus is used to investigate the origin and evolution of vertebrates. To better understand the characteristics of genome evolution from cephalochordates to Osteichthyes, we conducted a genome-wide pairwise comparison of protein-coding genes within amphioxus (a comparable group) and parallel analyses within Osteichthyes (two comparable groups). A batch of fast-evolving genes in each comparable group was identified. Of these genes, the most fast-evolving genes (top 20) were scrutinized, most of which were involved in immune system. An analysis of the fast-evolving genes showed that they were enriched into gene ontology (GO) terms and pathways primarily involved in immune-related functions. Similarly, this phenomenon was detected within Osteichthyes, and more well-known and abundant GO terms and pathways involving innate immunity were found in Osteichthyes than in cephalochordates. Next, we measured the expression responses of four genes belonging to metabolism or energy production-related pathways to lipopolysaccharide challenge in the muscle, intestine or skin of B. belcheri; three of these genes (HMGCL, CYBS and MDH2) showed innate immune responses. Additionally, some genes involved in adaptive immunity showed fast evolution in Osteichthyes, such as those involving "intestinal immune network for IgA production" or "T-cell receptor signaling pathway". In this study, the fast evolution of immune-related genes in amphioxus and Osteichthyes was determined, providing insights into the evolution of immune-related genes in chordates.
منابع مشابه
Transcriptome Sequencing of Guilan Native Cow in Comparison with bosTau4 Reference Genome
RNA-sequencing is a new method of transcriptome characterization of organisms. Based on identity and relatedness, there are large genetic variations among different cattle breeds. The goal of the current study was to sequence the transcriptome of Guilan native cow and compare with available reference genome using RNA-sequencing method. Blood samples were collected from 14 Guilan native cows and...
متن کاملPhylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملLong non-coding RNAs and their significance in human diseases
Protein-coding genes account for only a small fraction of the human genome and most of the genomic sequences are transcriptionally silent, but recent observations indicate significant functional elements, including non-coding protein transcripts in the human genome. Long non-coding RNAs (lncRNAs) have been defined as transcripts of >200 nucleotides without protein-coding capacity that perform t...
متن کاملBioinformatics Genome-Wide Characterization of the WRKY Gene Family in Sorghum bicolor
The WRKY gene family encodes a large group of transcription factors that regulate genes involved in plant response to biotic and abiotic stresses. Sorghum is a notable grain and forage crop in semi-arid regions because of its unusual tolerance against hot and dry environments. We identified a set of 85 WRKY genes in the S. bicolor genome and classified them into three groups (I–III). Among the ...
متن کاملPapaya Dieback in Malaysia: A StepTowards A New Insight of Disease Resistance
A recently published article describing the draft genome of Erwiniamallotivora BT-Mardi (1), the causal pathogen of papaya dieback infection in Peninsular Malaysia, hassignificant potential to overcome and reduce the effect of this vulnerable crop (2). The authors found that the draft genome sequenceis approximately 4824 kbp and the G+C content of the genomewas 52-54%, which is very similarto t...
متن کامل